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We propose a hierarchical logistic equation as a model to describe the dynamical behavior of 
a penetration rate of a prevalent stuff. In this model, a memory, how many people who already 
possess it a person who does not process it yet met, is considered, which does not exist in the 
logistic model. As an application, we apply this model to iPod sales data, and find that this model 
can approximate the data much better than the logistic equation. 
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I. INTRODUCTION 



How does fashion diffuse in our society? Fashion spreads out, although we, members 
of society, do not aim to do so: Most of us are not out to spread it, while some may 
have that aim. New stuffs that are somewhat of a curiosity at the beginning will become 
commonplace before we notice. Some of them may disappear from our life style. This 
phenomenon is essentially similar to various changes of phase in matter which we cannot 
imagine from an interaction between atoms or molecules. That is to say, the human being 
is "the social atom"|lJ. 

We want to clarify the mechanism producing the occurrence of fashion. In order to deal 
with this as a scientific problem, quantitative data indicating the extent of diffusing are 
necessary. Here, we shall employ a penetration rate. This has the following universality: 
Generally, penetration rates increase slowly at the beginning and, then, the growth reaches 
its maximum. Finally, the rates become saturated. This change in time is called the S- 
shaped curve. The logistic function, solution of the logistic equation, has often been used 
to analyze the rates. The logistic equation can be represented by the following differential 
equation. 

^ = ±z{t) {X - z{t)} (1) 
This solution is the logistic equation: 

x(t) = 7-^—, : • (2) 

If x(0) < X, then, x(t) is in an interval from to X and forms the S-shaped curve. Therefore, 
x(t)/X can be employed for penetration rates since it does not extend beyond 1. 

As is well known, the logistic equation was proposed as an equation describing a popu- 
lation growth with an upper limit by Verhulst (2 4 ] - However, the value of his study was 
not accepted in those days. In 1920, about a century later, Pearl and Reed rediscovered 



this equation while investigating the evolution of fly population [5]. Lotka also derived this 
equation as the model of population growth [6( . Their works excavated the logistic equation. 

Griliches made the first adoption of the logistic equation for the dynamical behavior of 
innovation diffusion [7]. He analyzed the penetration rate of hybrid corn among farmers 
by the logistic function. After that, Mansfiled justified using the logistic equation for the 



innovation diffusion mathematically [8j|. On the other hand, Fisher and Pry utilized this for 
a substitution of a share of two products, i.e., margarine and butter, {9]. 

In this way, the employment of the logistic equation for the innovation diffusion started 
and, then, this has been utilized for dynamical behavior of penetration rates for various 
stuffs: in the past decade, mobile phones [10l-ll2|. personal computers[13Nl7j . electronics 
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19 1 , energy technologies information technologies |l7(] and oxygen-steel making 



process 



22] . 



Thus, the change of many penetration rates in time can be described by the logistic 
function. But, what kind of human communication results in such a dynamical behavior? 
It is not a self-evident question. In this work, we shall unveil it first of all. According to 
our study, it is clear that the logistic equation applying to penetration rates supposes the 
following human communication: those who do not have a prevalent stuff start to possess 
it shortly after they meet people already possessing it, which will be verified by numerical 
simulation. 

Hence, a new question arises: Are we influenced by others so easily?, which is the real 
start-line of this paper. Therefore, we constructed a brand new model supposing more nat- 
ural human communication, which implies that we extend the logistic equation. Moreover, 
we adopt the total number of iPod sales as the real penetration data and, then, clarify that 
our model can describe the behavior much better than using the logistic equation. 



II. PENETRATION RATE IN AN IMITATING GROUP 

In this section, we unveil the human communication yielding the penetration rate which 
can be described by the logistic function. In addition, we confirm this by numerical simula- 
tion. 

Let us consider a group composed of iV people. For this group, we shall apply the following 
rules: i) At the beginning, some people have a stuff which will diffuse in this society, ii) If 
those who do not possess the stuff yet (non- adopters) meet people who already possessing 
it (adopters) , they start to adopt it at once, iii) A non-adopter is not influenced by more 
than one adopter and an adopter can not influence more than one non-adopter at the same 
time, iv) Adopters do not part with it. Such a group can be realized by considering people 
existing on lattices whose number is n x n (< N). We suppose that he/she moves to one 
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of the next lattices with the same probability at each step whose time interval is At. The 
third rule means that only one adopter and one non-adopter can share a lattice. Owing to 
these rules, we can think of empty lattices as those who do not interact with adopters and 
non-adopters: people who have no interests in the stuff. Thus, we can treat a group where 
there are adopters, future adopters and non-interested people. 

Here, we respectively set the number of adopters and non-adopters at ith step as Pi and 
Qi. Therefore, it is natural to consider that a probability to meet adopters or non-adopters 
is proportional to each number of them: 

Pi 



probability to meet adopters at ith step 

n 2 



probability to meet non-adopters at ith step = ^ 

n 2 



(3) 



Indeed, a probability to meet nobody at each step is (n 2 — Pi — Qi)/n 2 = 1 — N/n 2 , which 
also can be regarded as one to meet non-interested people. 

Therefore, (Pj/n 2 ) x Qi people of non-adopters become adopters at the next step, so that 
we can obtain the following recursion formulae: 

P l+1 =P t + ^Q, , (4) 
Pi 

Qi+i = Qi — ?Qi ■ (5) 

We shall define the number of them at t as P(t) — P(i- At) = Pi and Q(t) = Q(i- At) = Qi. 
Here, we take the limits as At — > and n — > oo with n 2 At fixed. By setting the fixed value 
as a/N and using P(t) + Q(t) = N, the following differential equation is derived: 

^- = ^{N-P(t)}P(t), (6) 

that is the exact logistic equation. The penetration rate p(t) = P(t)/N satisfies the following 
logistic equation 

^ = a{l-p(t)}p(t), (7) 
and this solution can be yielded as 

Pit) = — rr~^ — • ( 8 ) 
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Indeed, it is pointed out in Ref. jg] that the imitation is essential as the human communi- 
cation in a group where the change of penetration rate in time is expressed by the logistic 
equation. However, the above derivation helps us to reach the deeper comprehension . 



The parameter a, which is called the coefficient of imitation 17J, |22|, |23], determines the 
speed of the growth of the penetration. This can be expressed as 

v N 1 

a = hm — r — . (9) 

At-*> n 2 At v ' 

n— loo 

N/n 2 means the population density: The larger this value is, the faster the penetration rate 
grows. This is reflected by the simple fact that there are many encounters in the crowded 
society. 

Let us confirm these facts by using a brief numerical simulation. We have N players 
walking randomly onfixn lattices with periodic boundary condition. Note that this random 
walk is very simple and is different from that supposed on the above. In short, the rare case 
that a non-adopter can meet more than one adopter must occur. Furthermore, n 2 of this 
simulation is not large. Therefore, we have done 100 times independent simulations and, 
then, taken the ensemble average in order to negate the contribution from such a rare case. 

The results with N = 25 are show in Fig. [TJ with circles. In Fig. QJa) and (b), we set 
n as 25 and 18, respectively. The curves mean the logistic function 1/(1 + e~ a ^~ b ') with 
an arbitrary time unit. The parameters of the logistic function in Fig. [H(a) and (b) are 
(a, b) = (0.0457,56.4) and (a, b) = (0.921,27.8), respectively. The population density of (b) 
is about twice that of (a), which is consistent with the ratio of the parameter a as discussed. 



III. HIERARCHICAL LOGISTIC EQUATION 

It is clarified in the previous section that the dynamical behavior of the penetration rate 
depicted by the logistic function is based on imitation: People not possessing a prevalent 
stuff yet go buying it shortly after they meet people already possessing it. Here, most of 
us reach the same question: But are we really like that? In that rule, the definitive human 
psychology, when we imitate, is forgotten. That is, the memory, how many adopters we 
have met, is essential for us to start to possess the stuff 

In order to integrate this feature, hereby, we shall extend the rules of the group of random 
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FIG. 1. Penetration rates of the numerical simulations with 25 random walkers on n x n lattices 
shown by circles: (a) n = 25; (b) n = 18. The curves expresses the logistic function l/(l + e~ a<yt ~ b ^) 
with an arbitrary time unit. 



walker by the following way: We set the number of people starting to process the stuff after 
they meet /i adopters at i step as , in which we call /i as remaining adopters number 
(RAN). Indeed, if a non-adopter, whose RAN is fi, meet one of adopters, his/her RAN 
becomes /i — 1 at the next step. We do not alter other rules. Namely, we do not consider 
interactions between non-adopters despite the fact that the non-adopter gets more varied. 
If the maximum of RAN is m, the recursion formulae exchange into 

Pi+i = Pi + - 2 Q\ , (io) 
Qli = Ql-- 2 Ql + - 2 Qh (ii) 

QT+i = QT- ^QT • (12) 

By the previous continuation of time and space, we can obtain the following differential 
equations: 

^ = %Q\t)P(t) , (13) 

^ = ~^P{t)Q\t) + ^P{t)Q\t) , (14) 



= ~P(t)Q m (t) , (15) 



where Q»(t) = Q fl (i- At) = Q? . 



We shall call this the hierarchical logistic equation. Indeed, P(t) + Q 1 (t) + - ■ - + Q m (t) = N 
is always conserved. 

We can solve Eq. (|T5"|) easily: The solution is 



Q m (t) = Q m (0)exp 



-a [ dt'P(t') 
Jo 



(16) 



If Q m (0) = 0, Q m (t) is always 0. Then, the contribution of Q m into the differential equa- 
tion of Q m ~ x disappears, and so Q m ~ l can be calculated similarly. Therefore, if Q m (0) = 
gm-i(O) = . . . = Q2(o) = 0, Q m {t) = Q m -\t) = ■■■ = Q 2 (t) = 0, which means the normal 
logistic equation is recovered. Namely, the hierarchical logistic equation includes the normal 
one. 

If we use ratios of adopters and non-adopters to the total number N, the differential 
equations become 

dp(t) 



dt 
dq^t) 
dt 



aq\t) P (t) , (17) 
-ap(t)q l (t) + ap(t)q 2 (t) , (18) 



= -ap(t)q m (t) , (19) 



where q»(t) = Q»{t)/N. 



IV. FITTING THE IPOD SALES DATA 

Now, let us apply the hierarchical logistic equation to fitting a real data. As this data, 
we shall employ the iPod, created and marketed by Apple Inc., sales which can be obtained 



from the official Website, http://www.apple.com/ The amount of sales on a quarter is 
reported on the next quarter. Therefore, we consider the reported sales as on the middle of 
the previous quarter: the sales reported on the first quarter as on November in the last year, 
the sales reported on the second quarter as on February in this year and so on[? ]. Then, we 
plot the sales as a function of time in Fig. |2] where the first data is in November, 2001. As 
can be seen, there are six peaks after 2005. To our regret, the hierarchical logistic equation 
does not have the many peaks just like the logistic equation[? ]. Thus, we shall use the data 
from November on 2001 to May on 2006, which includes only one peak. We treat A as a 



fitting parameter, because the number of sales is not saturated and so we cannot obtain N 
from the data. 




FIG. 2. (color online) iPod 
sales data obtained from Ap- 
ple Inc.'s official Website, 
http://www.apple.com/, Each 



year includes four quarters, Febru- 
ary, May, August and November. 
The same (color) gray-level means 
the same quarter. The first bar is 
the data on November, 2001. 

Setting November on 2001 as the origin of time, we construct the cumulative sales and, 
then, we fit the data with the hierarchical logistic equation. If we minimize the residual 
sum of squares or the sum of the absolute value of error (SAE) when fitting this data, the 
solution of the hierarchical logistic equation does not match the data with small values. As 
an example, we show the cumulative iPod sales and the logistic function with parameters 
minimizing SAE in Fig. |3j The disagreement for small values can be seen from Fig. [3](b) . 
This results from a feature of the logistic function that P(t) must be small in order to make 
the growth of it, P(t), small. 





year 



year 



FIG. 3. Cumulative iPod sales expresses by circles and the logistic function with parameters 
minimizing SAE. 

Therefore, we shall minimize the product of SAE and the sum of the absolute value of 
relative error (SARE). The results are show in Tab. [I] The parameters except in the first 
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TABLE I. Parameters minimizing the product of SARE and SAE (or only SAE), SAREs and R 2 s 
by them. All values are rounded to a three-digit number, and so the sum of p(0) and p M (0) is not 
equal to one. 





m = 1 (logistic)* 


m = 1 (logistic) 


m = 2 


m = 3 


m = 4 


a 


1.89 


1.43 


4.42 


4.25 


4.17 


N [xlO 6 ] 


83.7 


1.28 x 10 5 


64.6 


65.4 


66.2 


p(0) 


4.80 x 1(T 4 


9.26 x 10" 7 


0.00207 


0.00200 


0.00189 




1.00 


1.00 


0.288 


0.311 


0.325 


9 2 (0) 






0.710 


0.675 


0.657 










0.0125 


0.00619 












0.00760 


SARE 


4.54 


1.84 


1.12 


1.02 


1.01 


R 2 


0.998 


0.959 


0.997 


0.998 


0.998 



*The parameters, with which the change of P(t)(= Np(t)) in time is shown in Fig. [3l minimize 

SAE. 



column minimize the product. On the contrary, those in the first column minimize SAE. 
SARE diminishes with increase of m: SARE with m = 4 reduces to nearly half that with 
m = 1. In other words, the average relative error for the hierarchical logistic model with 
m = 4, 5.3%, is about as half as that for the logistic model, 9.7%. For reference, we also show 
the coefficient of determination (R 2 ) in Tab. [H because this is employed on many papers in 
order to measure how well the logistic equation can approximate a real data. From Tab. [H 
R 2 is found to be not suitable for measuring the fitting accuracy: Those for the logistic and 
the hierarchical logistic model with m = 4 are nearly same. However, it is very obvious that 
the full curves in Fig. H] expressing the solution of the hierarchical logistic equation with 
m = 4 using parameters in Tab. [J approximates the data more precisely than the curves in 

Fig. m 

From the parameters with m = 4, we can find the following facts: the market size 
producing the first peak is about 66 million people; the ratio of the trend-conscious people, 
(^(O), is about 33%; the ratio of the cautious people, g 2 (0), about 66% and the ratio of the 
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more cautious people, g 3 (0) + g 4 (0), is about 1%. 

Comparing the logistic and the hierarchical logistic model, we show the result with m — 1 
and m = 4 in Fig. HJ The (purple) dashed and the (light blue) full curves represent the 
logistic and the hierarchical logistic model, respectively. The hierarchical logistic function 
with m = 4 matches the data which the logistic model cannot approximate. 




FIG. 4. (color online) Cumulative iPod sales expresses by circles and fitting curves. The (purple) 
dashed and (light blue) full curves are solutions of the logistic and the hierarchical logistic equation 
{m = 4) with parameters minimizing the product of SARE and SAE, respectively. 



V. CONCLUDING REMARKS 

In this work, we have unveiled the following fact that the essential human communication 
within a group, where the dynamical behavior of the penetration rate can be approximated 
by the logistic function, is imitation; non-adopters start to process a prevalent stuff shortly 
after meeting adopters. Indeed, this is not natural. Thereby, we have proposed the extended 
logistic equation, the hierarchical logistic equation, considering the memory of the number 
of adopters they met. In addition, we have applied this model to the change of iPod sales in 
time, and so the model has approximated the data much better than the logistic equation. 
As mentioned in the previous section, the logistic equation cannot describe a slow growth as 
seen in iPod sales in the early 2000's, but the hierarchical logistic equation can do so. The 
adopters of the hierarchical logistic equation have the inner structure, resulting in the slow 
growth of adopters. 

Wolf and Venus proposed an extended logistic equation describing a slow growth j^J . In 
their work, they introduced the delay time, ti, and multiplied 1 — exp(— t/ti) and the right 
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hand side of the logistic equation, Eq. ([T]). However, we emphasize that our model does not 
need to insert such a extra quantity. 

One of our conclusions is that the essential process for a stuff to spread is imitation. It 
is no doubt, however, that advertisements are also essential. The logistic model incorpo- 
rating this effect is the Bass model 25|. Therefore, as a future work, we shall produce the 
hierarchical Bass model by considering the memory on the Bass model. 
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